How to project a bipartite network? 
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The one-mode projecting is extensively used to compress the bipartite networks. Since the one- 
mode projection is always less informative than the bipartite representation, a proper weighting 
method is required to better retain the original information. In this article, inspired by the network- 
based resource-allocation dynamics, we raise a weighting method, which can be directly applied in 
extracting the hidden information of networks, with remarkably better performance than the widely 
used global ranking method as well as collaborative filtering. This work not only provides a creditable 
method in compressing bipartite networks, but also highlights a possible way for the better solution 
of a long-standing challenge in modern information science: How to do personal recommendation? 
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I. INTRODUCTION 



The last few years have witnessed a tremendous ac- 
tivity devoted to the understanding of complex networks 
[l|, S S S H, B H- A particular class of networks is 
the bipartite networks, whose nodes are divided into two 
sets, X and Y, and only the connection between two 
nodes in different sets is allowed (as illustrated in Fig. 
la). Many systems are naturally modeled as bipartite 
networks [8|: Human sexual network l9| is consisted of 
men and women, metabolic network [lO| is consisted of 
chemical substances and chemical reactions, etc. Two 
kinds of bipartite networks should be paid more atten- 
tion for their particular significance in social, economic 
and information systems. One is the so-called collabora- 
tion network, which is generally defined as a networks of 
actors connected by a common collaboration act [ll|i ■ 
Examples are numerous, includin g sc ientists connected 
by coauthoring a scientific paper [l3l . .movie actors 
connected by costarring the same movie [ll, [l^ , and so 
on. Moreover, the concept of collaboration network is 
not necessarily restricted within social systems (see, for 
example, recent reports on technological collaboration of 
software fl^ and urban traffic systems [13 )■ Although 
the collaboration network is usually displayed by the one- 
mode projection on actors (see later the definition), its 
fully representation is a bipartite network. The other 
one is named opinion network [Tsl . [Toj , where each node 
in the user-set is connected with its collected objects in 
the object-set. For example, listeners are connected with 
the music groups they collected from music-sharing li- 
brary (e.g. audioscrobbler.com) [l^, [2l|, web- users are 
connected with the webs they collected in a bookmark 
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site (e.g. delicious) [22j, customers are connected with 
the books they bought (e.g. Amazon.com) [2^. |2^. 

Recently, a large amount of attention is addressed to 
analyzing % HO,!!!, [H, [H] and modeling [H, [H [s^l 
bipartite network. However, for the convenience of di- 
rectly showing the relations among a particular set of 
nodes, the bipartite network is usually compressed by 
one-mode projecting. The one-mode projection onto X 
(X-projection for short) means a network containing only 
X-nodes, where two X-nodes are connected when they 
have at least one common neighboring K-node. Fig. lb 
and Fig. Ic show the resulting networks of X-projection 
and y-projection, respectively. The simplest way is to 
project the bipartite network onto an unweighted net- 
work [l^j Hi HU, [13, [1^ , without taking into account of 
the frequency that a collaboration has been repeated. Al- 
though some topological properties can be qualitatively 
obtained from this unweighted version, the loss of in- 
formation is obvious. For example, if two listeners has 
collected more than 100 music groups each (it is a typical 
number of collections, like in audioscrobbler.com, the av- 
erage number of collected music groups per listener is 140 
[20|). and only one music group is selected by both listen- 
ers, one may conclude that those two listeners probably 
have different music taste. On the contrary, if nearly 100 
music groups belong to the overlap, those two listeners 
are likely to have very similar habits. However, in the un- 
weighted listener-projection, this two cases have exactly 
the same graph representation. 

Since the one-mode projection is always less informa- 
tive than the original bipartite network, to better reflect 
structure of the network, one has to use the bipartite 
graph to quantify the weights in the projection graph. A 
straightforward way is to weight an edge directly by the 
number of times the corresponding partnership repeated 
[H, [HI . This simple rule is used to obtain the weights in 
Fig. lb and Fig. Ic for X-projection and y-projection, 
respectively. This weighted network is much more infor- 
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FIG. 1: Illustration of a bipartite network (a), as well as its 
X-projection (b) and F-projection (c). The edge- weight in 
(b) and (c) is set as the number of common neighbors in Y 
and X, respectively. 



mative than the unweighted one, and can be analyzed 
by standard techniques for unweighted graphs since its 
weights are all integers [s^. However, this method is 
also quantitatively biased. Li et al. [sTl empirically stud- 
ied the scientific collaboration networks, and pointed out 
that the impact of one additional collaboration paper 
should depend on the original weight between the two 
scientists. For example, one more co-authorized paper 
for the two authors having only co-authorized one pa- 
per before should have higher impact than for the two 
authors having already co-authorized 100 papers. This 
saturation effect can be taken into account by introduc- 
ing a hyperbolic tangent function onto the simple count 
of collaborated times [13 . As stated by Newman that 
two scientists whose names appear on a paper together 
with many other coauthors know one another less well 
on average than two who were the sole authors of a pa- 
per to consider this effect, he introduced the factor 
l/(n — 1) to weaken the contribution of collaborations in- 
volving many participants [3l,[39|, where n is the number 
of participants (e.g. the number of authors of a paper). 

How to weight the edges is the key question of the 
one-mode projections and their use. However, we lack a 
systematic exploration of this problem, and no solid base 
of any weighting methods have been reported thus far. 
For example, one may ask the physical reason why using 
the hyperbolic tangent function to address the satura- 
tion effect |i37] rather than other infinite possible candi- 
dates. In addition, for simplicity, the weighted adjacent 
matrix {wij} is always set to be symmetrical, that is, 
Wij = Wji. However, as in scientific collaboration net- 
works, different authors may assign different weights to 
the same co-authorized paper, and it is probably the case 
that the author having less publications may give a higher 
weight, vice versa. Therefore, a more natural weighting 
method may be not symmetrical. Another blemish in 
the prior methods is that the information contained by 
the edge whose adjacent X-node (F-node) is of degree 



one will be lost in Y-projection (X-projection). This 
information loss may be serious in some real opinion net- 
works. For example, in the user-web network of delicious 
(http://del.icio.us), a remarkable fraction of webs have 
been collected only once, as well as a remarkable fraction 
of users have collected only one web. Therefore, both the 
user-projection and web-projection will squander a lot of 
information. Since more than half publications in Math- 
ematical Reviews have only one author [sij , the situation 
is even worse in mathematical collaboration network. 

In this article, we propose a weighting method, with 
asymmetrical weights (i.e., Wij ^ wji) and allowed self- 
connection (i.e., Wii > 0). This method can be directly 
applied as a personal recommendation algorithm, which 
performs remarkably better than the widely used global 
ranking method (GRM) and collaborative filtering (CF). 



II. METHOD 

Without loss of generality, we discuss how to deter- 
mine the edge-weight in X-projection, where the weight 
Wij can be considered as the importance of node i in j's 
sense, and it is generally not equal to Wji. For exam- 
ple, in the book-projection of a customer-book opinion 
network, the weight Wij between two books i and j con- 
tributes to the strength of book i recommendation to a 
customer provided he has brought book j. In the sci- 
entific collaboration network, Wij reflects how likely is j 
to choose z as a contributor for a new research project. 
More generally, we assume a certain amount of a resource 
(e.g. recommendation power, research fund, etc.) is asso- 
ciated with each X-node, and the weight Wij represents 
the proportion of the resource j would like to distribute 
to i. 

To derive the analytical expression of Wij , we go back 
to the bipartite representation. Since the bipartite net- 
work itself is unweighted, the resource in an arbitrary 
X-node should be equally distributed to its neighbors in 
Y. Analogously, the resource in any F-node should be 
equally distributed to its X-neighbors. As shown in Fig. 
2a, the three X-nodes are initially assigned weights x, 
y and z. The resource- allocation process consists of two 
steps; first from X to Y, then back to X. The amount of 
resource after each step is marked in Fig. 2b and Fig. 2c, 
respectively. Merging these two steps into one, the final 
resource located in those three X-nodes, denoted by x' , 
y' and z' , can be obtained as: 

/ x'\ ( 11/18 1/6 5/18 \ ( x\ 

\y'\ = \ 1/9 5/12 5/12 \ \v\. (1) 

\z' ) \ 5/18 5/12 4/9 J \ z J 

Note that, this 3x3 matrix are column normalized, and 
the element in the ith row and jth column represents the 
fraction of resource the jth X-node transferred to the ith 
X-node. According to the above description, this matrix 
is the very weighted adjacent matrix we want. 
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FIG. 2: Illustration of the resource-allocation process in bi- 
partite network. The upper three are X-nodes, and the lower 
four are F-nodes. The whole process consists of two steps: 
First, the resource flows from X to Y (a— i-b), and then re- 
turns to X (b— >c). Different from the prior network-based 
resource-allocation dynamics [4^, the resource here can only 
flow from one node-set to another node-set, without consid- 
eration of asymptotical stable flow among one node-set. 



Now, consider a general bipartite network G{X, Y, E), 
where E is the set of edges. The nodes in X and Y 
are denoted by xi, 2:2, • ■ • , a;„ and yi,y2, ■ ■ ■ ,ymi respec- 
tively. The initial resource located on the ith X-node is 
f{xi) > 0. After the first step, all the resource in X flows 
to Y, and the resource located on the Zth y-node reads, 



(2) 



where k(xi) is the degree of Xi, and an is an n x to adja- 
cent matrix as 



1, Xiyi e E, 
0, otherwise. 



(3) 



In the next step, all the resource flows back to X, and 
the final resource located on reads, 



m m ^ f t \ 



This can be rewritten as 



(4) 
(5) 



^ m 



(6) 



which sums the contribution from all 2-step paths be- 



tween Xi and Xi. The matrix W 



}n> 



repre- 



sents the weighted X-projection we were looking for. The 
resource-allocation process can be written in the matrix 

form as /' = ly / . 

It is worthwhile to emphasize the particular characters 
of this weighting method. For convenience, we take the 
scientific collaboration network as an example, but our 
statements are not restricted to the collaboration net- 
works. Firstly, the weighted matrix is not symmetrical 
as 



(7) 



This is in accordance with our daily experience - the 
weight of a single collaboration paper is relatively small if 
the scientist has already published many papers (i.e., he 
has large degree), vice versa. Secondly, the diagonal ele- 
ments in W are nonzero, thus the information contained 
by the connections incident to one-degree Y-node will not 
be lost. Actually, the diagonal element is the maximal 
element in each column. Only if all Xi 's F- neighbors be- 
longs to Xj's neighbors set, wu = Wji. It is usually found 
in scientific collaboration networks, since some students 
coauthorize every paper with their supervisors. There- 
fore, the ratio Wji/wa < 1 can be considered as x^'s re- 
searching independence to Xj, the smaller the ratio, the 
more independent the researcher is, vice versa. The in- 
dependence of Xi can be approximately measured as 



E 



(8) 



Generally, the author who often publishes papers solely, 
or often publishes many papers with different coauthors 
is more independent. Note that, introducing the measure 
li here is just to show an example how to use the informa- 
tion contained by self- weight wu, without any comments 
whether to be more independent is better, or contrary. 



III. PERSONAL RECOMMENDATION 

The exponential growth of the Internet [41*1 and World- 
Wide- Web p3 | confronts people with an information 
overload: They are facing too many data and sources 
to be able to find out those most relevant for him. One 
landmark for information filtering is the use of search 
engines (43j . however, it can not solve this overload prob- 
lem since it does not take into account of personalization 
thus returns the same results for people with far different 
habits. So, if user's habits are different from the main- 
stream, it is hard for him to find out what he likes in the 
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FIG. 3: (color online) The predicted position of each entry in 
the probe ranked in the ascending order. The black, red and 
blue curves, from top to bottom, represent the cases of GRM, 
CF and NBI, respectively. The mean values are top 13.9% 
(GRM), top 12.0% (CF) and top 10.6% (NBI). 



countless searching results. Thus far, the most potential 
way to efficiently filter out the information overload is 
to recommend personally. That is to say, using the per- 
sonal information of a user (i.e., the historical track of 
this user's activities) to uncover his habits and to con- 
sider them in the recommendation. For instances, Ama- 
zon.com uses one's purchase history to provide individual 
suggestions. If you have bought a textbook on statistical 
physics, Amazon may recommend you some other sta- 
tistical physics books. Based on the well-developed Web 
2. technology [31 , the recommendation systems are fre- 
quently used in web-based movie-sharing (music-sharing, 
book-sharing, etc.) systems, web-based selling systems, 
bookmark web-sites, and so on. Motivated by the signif- 
icance in economy and society, recently, the design of an 
efficient recommendation algorithm becomes a joint focus 
from marketing practice [451 . l46j to mathematical analy- 
sis [431 , from engineering science [H, [i^, [s^ to physics 
community [51, 52, 53]. 

Basically, a recommendation system consists of users 
and objects, and each user has collected some objects. 
Denote the object-set as O = {01,02, • • • , o„} and user- 
set asU = {ux,U2, ■ ■ • , Um}- If users are only allowed to 
collect objects (they do not rate them), the recommenda- 
tion system can be fully described by an n x m adjacent 
matrix {uij}, where a^- = 1 if Uj has already collected o;, 
and fly = otherwise. A reasonable assumption is that 
the objects you have collected are what you like, and a 
recommendation algorithm aims at predicting your per- 
sonal opinions (to what extent you like or hate them) on 
those objects you have not yet collected. A more compli- 
cated case is the voting system [13, [55| , where each user 
can give ratings to objects (e.g., in the Yahoo Music, the 
users can vote each song with 5 discrete ratings repre- 
senting Never play again, It is ok, Like it. Love it, and 



Can't get enough), and the recommendation algorithm 
concentrates on estimating unknown ratings for objects. 
These two problems are closely related, however, in this 
article, we focus on the former case. 

Denote fc(oi) = Y^^=i ^ij ^^'^ degree of object o^. The 
global ranking method (GRM) sorts all the objects in the 
descending order of degree and recommends those with 
highest degrees. Although the lack of personalization 
leads to an unsatisfying performance of GRM (see nu- 
merical comparison in the next section) , it is widely used 
since it is simple and spares computational resources. For 
example, the well-known Yahoo Top 100 MTVs, Amazon 
List of Top Sellers, as well as the board of most down- 
loaded articles in many scientific journals, can be all con- 
sidered as results of GRM. 

Thus far, the widest applied personal recommendation 
algorithm is collaborative filtering (CF) [SO, [l^l, based 
on a similarity measure between users. Consequently, 
the prediction for a particular user is made mainly using 
the similar users. The similarity between users Ui and Uj 
can be measured in the Pearson-like form 



En 
1=1 O'liO'l] 

ii{k{u.i), k{uj)} ' 



(9) 



where k{ui) = X^/Li ^^"^ degree of user Ui. For any 
user-object pair Ui — Oj, if Ui has not yet collected Oj (i.e., 
0), by CF, the predicted score, (to what extent 



likes Oj), is given as 



Em 

m 



Sli 



(10) 



Two factors give rise to a high value of Vij. Firstly, if 
the degree of Oj is larger, it will, generally, have more 
nonzero items in the numerator of Eq. (10). Secondly, if 
Oj is frequently collected by users very similar to Ui, the 
corresponding items will be significant. The former pays 
respect to the global information, and the latter reflects 
the personalization. For any user Ui, all the nonzero Vij 
with aji = are sorted in descending order, and those 
objects in the top are recommended. 

We propose a recommendation algorithm, which is a 
direct application of the weighting method for bipartite 
networks presented above. The layout is simple: first 
compress the bipartite user-object network by object- 
projection, the resulting weighted network we label G. 
Then, for a given user Ui, put some resource on those 
objects already been collected by ui. For simplicity, we 
set the initial resource located on each node of G as 



(11) 



That is to say, if the object Oj has been collected by 
Ui, then its initial resource is unit, otherwise it is zero. 
Note that, the initial configuration, which captures per- 
sonal preferences, is different for different users. The ini- 
tial resource can be understood as giving a unit recom- 
mending capacity to each collected object. According to 
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the weighted resource-allocation process discussed in the 
prior section, the final resource, denoted by the vector 

/' , is /' — W f . Thus components of /' are 

n n 

/'(oj) ^^Wjifioi) ='^WjiaH. (12) 

1=1 1=1 

For any user Ui, all his uncollected objects Oj (1 < j < n, 
aji = 0) are sorted in the descending order of f'{oj), 
and those objects with highest value of final resource are 
recommended. We call this method network-based infer- 
ence (NBI) , since it is based on the weighted network G. 
Note that, the calculation of Eq. (12) should be repeated 
m times, since the initial configurations are different for 
different users. 



IV. NUMERICAL RESULTS 

We use a benchmark data-set, namely MovieLens, to 
judge the performance of described algorithms. The 
MovieLens data is downloaded from the web-site of Grou- 
pLens Research (http://www.grouplcns.org). The data 
consists 1682 movies (objects) and 943 users. Actually, 
MovieLens is a rating system, where each user votes 
movies in five discrete ratings 1-5. Hence we applied the 
coarse-graining method similar to what is used in Ref. 
[l9i]: A movie has been collected by a user iff the giv- 
ing rating is at least 3. The original data contains 10^ 
ratings, 85.25% of which are > 3, thus the user-movie 
bipartite network after the coarse gaining contains 85250 
edges. To test the recommendation algorithms, the data 
set (i.e., 85250 edges) is randomly divided into two parts: 
The training set contains 90% of the data, and the re- 
maining 10% of data constitutes the probe. The training 
set is treated as known information, while no information 
in probe set is allowed to be used for prediction. 

All three algorithms, GRM, CF and NBI, can provide 
each user an ordered queue of all its uncollected movies. 
For an arbitrary user Ui, if the edge Ui — Oj is in the 
probe set (according to the training set, Oj is an uncol- 
lected movie for Ui), we measure the position of Oj in the 
ordered queue. For example, if there are 1500 uncollected 
movies for Ui, and Oj is the 30th from the top, we say the 
position of oj is the top 30/1500, denoted by = 0.02. 
Since the probe entries are actually collected by users, 
a good algorithm is expected to give high recommenda- 
tions to them, thus leading to small r. The mean value 
of the position value, averaged over entries in the probe, 
are 0.139, 0.120 and 0.106 by GRM, CF and NBI, respec- 
tively. Fig. 3 reports the distribution of all the position 
values, which are ranked from the top position (r —^ 0) 
to the bottom position (r ^ 1). Clearly, NBI is the best 
method and GRM performs worst. 

To make this work more relevant to the real-life recom- 
mendation systems, we introduce a measure of algorith- 
mic accuracy that depends on the length of recommen- 
dation list. The recommendation list for a user Ui, if of 
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FIG. 4: The hitting rate as a function of the length of recom- 
mendation list. The black, red and blue curves, from bottom 
to top, represent the cases of GRM, CF and NBI, respectively. 



TABLE I: The hitting rates for some typical lengths of rec- 
ommendation list. 



Length 


GRM 


CF 


NBI 


10 


10.3% 


14.1% 


16.2% 


20 


16.9% 


21.6% 


24.8% 


50 


31.1% 


37.0% 


41.2% 


100 


45.2% 


51.0% 


55.9% 



length L, contains L highest recommended movies result- 
ing from the algorithm. For each incident entry Ui — Oj 
in the probe, if Oj is in m^'s recommendation list, we say 
the entry Ui — Oj is hit by the algorithm. The ratio of 
hit entries to the population is named hitting rate. For a 
given L, the algorithm with a higher hitting rate is bet- 
ter, and vice versa. If L is larger than the total number 
of uncollected movies for a user, the recommendation list 
is defined as the set of all his uncollected movies. Clearly, 
the hitting rate is monotonously increasing with L, with 
the upper bound 1 for sufficiently large L. In Fig. 4, we 
report the hitting rate as a function of L for different al- 
gorithms. In accordance with Fig. 3, the accuracy of the 
algorithms is NBI > CF > GRM. The hitting rates for 
some typical lengths of recommendation list are shown 
in Table I. 

In a word, via the numerical calculation on a bench- 
mark data set, we have demonstrated that the NBI 
has remarkably better performance than GRM and CF, 
which strongly guarantee the validity of the present 
weighting method. 



V. CONCLUSION AND DISCUSSION 

Weighting of edges is the key problem in the construc- 
tion of a bipartite network projection. In this article 
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we proposed a weighting method based on a resource- 
allocation process. The present method has two promi- 
nent features. First, the weighted matrix is not symmet- 
rical, and the node having larger degree in the bipartite 
network generally assigns smaller weights to its incident 
edges. Second, the diagonal element in the weighted ma- 
trix is positive, which makes the weighted one-mode pro- 
jection more informative. 

Furthermore, we proposed a personal recommendation 
algorithm based on this weighting method, which per- 
forms much better than the widest used global ranking 
method as well as the collaborative filtering. Especially, 
this algorithm is tune- free (i.e., does not depend on any 
control parameters), which is a big advantage for poten- 
tial users. The main goal of this article is to raise a 
new weighting method, as well as provide a bridge from 
this method to the recommendation systems. The pre- 
sented recommendation algorithm is just a rough frame- 
work, whose details have not been exhaustively explored 
yet. For example, the setting of the initial configura- 
tion may be oversimplified, a more complicated form, like 
f{oj) = ajik^{oj), may lead to a better performance than 
the presented one with (3 = 0. One is also encouraged 
to consider the asymptotical dynamics of the resource- 
allocation process [40| , which can eventually lead to some 
certain iterative recommendation algorithms. Although 
such an algorithm require much longer CPU time, it may 
give more accurate prediction than the present algorithm. 

If we denote (fc^) and (fco) the average degree of users 
and objects in the bipartite network, the computational 
complexity of CF is 0(m^(fc„) + mn{ko)), where the first 
term accounts for the calculation of similarity between 
users (see Eq. (9)), and the second term accounts for the 
calculation of the predicted score (see Eq. (10)). Sub- 
stituting the equation n{ko) = m{ku), we are left with 
0{m^{ku))- The computational complexity for NBI is 



0(m(fc^) -I- mn{ku)) with two terms accounting for the 
calculation of the weighted matrix and the final resource 
distribution, respectively. Here (fc^) is the second mo- 
ment of the users' degree distribution in the bipartite 
network. Clearly, (fc,^) < n(A:„), thus the resulting form 
is 0{mn{ku)). Note that the number of users is usually 
much larger than the number of objects in many recom- 
mendation systems. For instance, the EachMovie dataset 
provided by the Compaq company contains m = 72916 
users and n = 1628 movies, and the Netflix company 
provides nearly 20 thousands online movies for million 
users. It is also the case of music-sharing systems and 
online bookstores, the number of registered users is more 
than one magnitude larger than that of the available ob- 
jects (e.g., music groups, books, etc.). Therefore, NBI 
runs much fast than CF. In addition, NBI requires 
memory to store the weighted matrix {wij}, while CF 
requires memory to store the similarity matrix {sij}. 
Hence, NBI is able to beat CF in all the three criteri- 
ons of recommendation algorithm: accuracy, time and 
space. However, in some recommendation systems, as in 
bookmark sharing websites, the number of objects (e.g. 
webpages) is much larger than the number of users, thus 
CF may be more practicable. 
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